Robustness against separation and outliers in logistic regression
نویسندگان
چکیده
The logistic regression model is commonly used to describe the e,ect of one or several explanatory variables on a binary response variable. It su,ers from the problem that its parameters are not identi/able when there is separation in the space of the explanatory variables. In that case, existing /tting techniques fail to converge or give the wrong answer. To remedy this, a slightly more general model is proposed under which the observed response is strongly related but not equal to the unobservable true response. This model will be called the hidden logistic regression model because the unobservable true responses are comparable to a hidden layer in a feedforward neural net. The maximum estimated likelihood estimator is proposed in this model. It is robust against separation, always exists, and is easy to compute. Outlier-robust estimation is also studied in this setting, yielding the weighted maximum estimated likelihood estimator. c © 2002 Elsevier Science B.V. All rights reserved.
منابع مشابه
Some New Estimation Methods for Weighted Regression When There Are Possible Outliers
The problem of estimating the variance parameter robustly in a heteroscedatic linear model is considered. The situation where the variance is a function of the explanatory variables is treated. To estimate the variance robustly in this case, it is necessary to guard against the influence of outliers in the design as well as outliers in the response. By analogy with the homoscedastic regression ...
متن کاملComparison of ordinary logistic regression and robust logistic regression models in modeling of pre-diabetes risk factors
Background: Regarding the increased risk of developing type 2 diabetes in pre-diabetic people, identifying pre-diabetes and determining of its risk factors seems so necessary. In this study, it is aimed to compare ordinary logistic regression and robust logistic regression models in modeling pre-diabetes risk factors. Methods: This is a cross-sectional study and conducted on 6460 people, over ...
متن کاملUsing Classic Discriminant Analysis and Detection Function for Separation of Chemical Victims in Sardasht City to Exposed and Non Exposure Mustard Groups in 2013 and Comparison with Logistic Regression
متن کامل
Robust and Sparse Regression via γ-Divergence
In high-dimensional data, many sparse regression methods have been proposed. However, they may not be robust against outliers. Recently, the use of density power weight has been studied for robust parameter estimation, and the corresponding divergences have been discussed. One such divergence is the γ-divergence, and the robust estimator using the γ-divergence is known for having a strong robus...
متن کاملRobustness of reweighted Least Squares Kernel Based Regression
Kernel Based Regression (KBR) minimizes a convex risk over a possibly infinite dimensional reproducing kernel Hilbert space. Recently it was shown that KBR with a least squares loss function may have some undesirable properties from a robustness point of view: even very small amounts of outliers can dramatically affect the estimates. KBR with other loss functions is more robust, but often gives...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 43 شماره
صفحات -
تاریخ انتشار 2003